Studying Translationese at the Character Level

نویسنده

  • Marius Popescu
چکیده

This paper presents a set of preliminary experiments which show that identifying translationese is possible with machine learning methods that work at character level, more precisely methods that use string kernels. But caution is necessary because string kernels very easily can introduce confounding factors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Translationese Traits in Romanian Newspapers: A Machine Learning Approach

This paper presents a machine learning approach to the investigation of the translationese effect on Romanian newspapers texts. The aim is to train a learning system to distinguish between translated and non-translated texts. The classifiers achieve an accuracy well above the chance level, the results confirming the existence of translationese manifestation. Also, the experiments investigate wh...

متن کامل

On grammatical translationese

In most published work on bilingual data from aligned corpora (variously called parallel corpora, translated corpora, bi-texts or corpora of parallel texts), it is the lexicon that has been studied (cf. Klavans and Tzoukermann (1990), Church and Gale (1991), Marinai et al. (1991), Isabelle et al. (1993)). In this paper, I am concerned with corpus-based contrastive studies of grammatical feature...

متن کامل

Quantitative Analysis of Translation Revision: Contrastive Corpus Research on Native English and Chinese Translationese

Demand for Chinese-to-English translation has increased over recent years. In contrast, resources for training translators for Chinese-to-English are few although increasing now, relative to English-to-Chinese for example. Corpus-based techniques are now more widely acknowledged as being appropriate for the study of translation. A number of Chinese/English parallel translation corpora have been...

متن کامل

Cognitive Character Engagement and the Extended Mind in Bahram Tavakkoli’s Wandering in the Fog

The present study has a cognitive approach to empathy and character engagement in cinema. In discussing how through empathy and engagement with real and fictional characters, we extend our minds and enhance our cognitive abilities, the present paper studies Bahram Tavakkoli’s Wandering in the Fog (2010), to show how Andy Clark and David Chalmers’s idea of the extended mind as joined with ...

متن کامل

A Parallel Corpus of Translationese

We describe a set of bilingual English–French and English–German parallel corpora in which the direction of translation is accurately and reliably annotated. The corpora are diverse, consisting of parliamentary proceedings, literary works, transcriptions of TED talks and political commentary. They will be instrumental for research of translationese and its applications to (human and machine) tr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011